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ATGGCGCGGGCGGACACGGGCCGCGGGCTCCTGGTGCTGACCTTCTGCCTGCTGTCCGCG 6 0 
CGCGGGGAGCTGCCATTGCCCCAGGAGACAACTGTCAAGCTGAGCTGTGATGAGGGACCC 120 
CTGCAAGTGATCCTGGGCCCTGAGCAGGCTGTGGTGCTGGACTGCACTTTGGGGGCTACA 18 0 
GCTGCTGGGCCTCCGACCAGGGTGACATGGAGCAAGGATGGAGACACTGTACTAGAGCAT 24 0 
GAGAACCTGCACCTGCTACCCAATGGCTCCCTGTGGCTGTCCTCACCCCTAGAGCAAGAA 300 
GACAGCGATGATGAGGAAGCTCTTAGGATCTGGAAGGTCACTGAGGGCAGCTATTCCTGT 360 
CTGGCCCACAGCCCGCTAGGAGTGGTGGCCAGCCAGGTTGCTGTGGTCAAGCTTGCCACA 42 0 
CTCGAAGACTTCTCTCTGCACCCCGAGTCCCAGATTGTGGAGGAGAACGGGACAGCACGC 4 80 
TTTGAATGCCACACCAAGGGCCTTCCAGCCCCCATCATTACTTGGGAAAAGGACCAGGTG 54 0 
ACCGTGCCTGAGGAGCCCCGGCTCATCACTCTTCCCAAGTGGCTCCTCCAGATCCTAGAT 600 
GTCCAGGACAGTGATGCAGGCTCCTACCGCTGCGTGGCCACCAATTCAGCCCGCCAACGA 660 
TTCAGCCAGGAGGCCTCGCTCACTGTGGCCCTCAGAGGGTCTTTGGAGGCTACCAGGGGG 720 
CAGGATGTGGTCATTGTGGCAGCCCCAGAGAACACCACGGTAGTGTCTGGACAGAATGTA 780 
GTGATGGAGTGCGTGGCCTCTGCTGACCCGACCCCTTTTGTGTCCTGGGTCCGACAGGAT 840 
GGAAAGCCTATCTCCACGGATGTCATCGTTCTCGGCCGGACCAATCTACTCATCGCCAGC 900 
GCGCAGCCTCGGCACTCTGGAGTCTATGTCTGCCGAGCCAACAAGCCCCTCACGCGTGAC 960 
TTCGCCACTGCGGCTGCTGAGCTCCGAGTGCTTGCTGCCCCAGCCATCTCGCAGGCACCC 1020 
GAGGCGOTCTCGCGGACGCGGGC(^GCACCGCGCGOTTCGTGTGCCGGGCGTCCGGGGAG 1080 
CCACGGCCCGCGCTGCACTGGCTGCACGACGGGATCCCGTTGCGACCCAATGGGCGCGTC 1140 
AAGGTGCAGGGCGGTGGCGGCAGCTTGGTCATCACTCAGATCGGCCTGCAGGACGCTGGC 1200 
TACTACCAGTGCGTAGCAGAAAACAGCGCGGGAACTGCCTGTGCCGCTGCGCCCCTGGCG 1260 
GTAGTGGTGCGCGAGGGGCTGCCCAGCGCCCCGACTCGGGTCACAGCCACGCCGCTGAGC 1320 
AGCTCCTCTGTGCTGGTGGCCTGGGAGCGGCCTGAGTTGCACAGCGAGCAAATCATTGGC 1380 
TTCTCTCTTCACTACCAAAAGGCAAGGGGAGTGGACAATGTGGAGTACCAGTTTGCAGTA 144 0 
AACAATGACACCACAG AGCTGC AGGTTCGGGACCTGGAACCCAACACGGATTATGAGTTC 1500 
TACGTGGTGGCCTACTCCCAGCTGGGGGCCAGCCGAACCTCCAGCCCAGCCCTGGTGCAT 156 0 
ACACTGGACGATGTCCCCAGCGCAGCACCCCAGCTTACCTTGTCCAGCCCCAACCCCTCG 162 0 
GACATCAGGGTGGCATGGCTGCCCCTGCCCTCCAGCCTGAGCAATGGACAGGTGCTGAAG 168 0 
TACAAGATAGAGTACGGTTTGGGGAAGGAAGATCAGGTTTTCTCCACCGAGGTGCCTGGA 1740 
AATGAGACACAACTTACGTTAAACTCACTTCAGCC^VACAAAGTGTACCGAGTCCGGATT 18 00 
TCAGCTGGCACTGGCGCTGGCTATGGAGTCCCTTCTCAGTGGATGCAGCACAGGAC ACCT 186 0 
GGTGTGCACAACCAGAGCCATGTTCCCTTTGCCCCTGCAGAATTGAAGGTGAGGGCAAAG 1920 
ATGGAGTCCCTGGTGGTGTCATGGCAGCCGCCCCCTCACCCCACCCAGATCTCTGGATAC 1980 
AAACTCTACTGGGGAGAGGTGGGAACAGAGGAGGAGGCAGATGGTG ACCGCCCCCCAGGG 204 0 
GGTCGTGGAGATCAAGCTTGGGACGTCGGGCCCGTGCGGCTGAAGAAGAAAGTGAAGCAG 2100 
TATGAACTGACCCAGTTAGTCCCTGGCAGGCCGTACGAGGTGAAGCTCGTAGCTTTCAAC 2160 
AAACACGAGGACGGCTACGCTGCTGTGTGGAAGGGCAAGACGGAGAAGGCGCCCACGCCA 2220 
GACCTGCCTATCCAGAGGGGGCCACCGCTGCCTCCTGCCCATGTCCACGCAGAGTCAAAC 2280 
AGCTCCACTTCCATTTGGCTTCGGTGGAAGAAGCCAGACTTTACCACTGTCAAGATTGTC 2340 
AACTAC ACTGTACGCTTCGGC C C CTGGGGGCTCAGG AATGC TT C C CTGGTC AC CT ACT AT 2400 
ACCAGCTCTGGAGAAGACATTCTC^TTGGCGGCCTGAAACCATTTACCAAGTACGAGTTT 2460 
GCGGTACAGTCCCACGGAGTGGATATGGATGGGCCCTTTGGCTCCGTCGTAG AACGCTCC 252 0 
ACCCTGCCTGACCGGCCTTCAACACCTCCTTCTGACCTGCGCCTGAGCCCCCTGACACCA 2 580 
TCCACCGTTCGGTTACACTGGTGTCCCCCCACGGAGCCCAATGGTGAGATTGTGGAGTAT 2640 
CTAATTCTCTACAGCAACAACCACACCCAGCCCGAACACCAGTGGACACTGCTCACCACA 2700 
GAGGGAAACATCTTCAGTGCAGAGGTCCATGGCCTAGAGAGTGACACTCGGTATTTCTTC 2760 
AAGATGGGAGCCCGCACAGAGGTGGGGCCTGGGCCCTTTTCCCGCTTGCAGGATGTGATT 282 0 
ACTCTGCAAGAGACATTCTC AGACTCCTTGGATGTGCACGCCGTCACGGGCATCATCGTG 2 880 
GGTGTCTGCCTGGGCCTTCTCTGCCTCCTGGCCTGCATGTGTGCTGGCCTACGACAAAGC 294 0 
TCCCACAGGGAAGCCCTTCCCGGATTGTCCTCCTCAGGCACCCCAGGAAACCCAGCGCTC 3 000 
TACACAAGAGCTCGGCTTGGGCCTCCCAGTGTCCCTGCTGCCCATGAGTTGGAGTCCCTC 306 0 
GTGCATCCTCGTCCCC AGGATTGGTCCCCACCACCCTCAGATGTGGAAGACAAGGCTGAA 3120 
GTACACAGCCTTATGGGTGGCAGTGTTTCAGATTGCCGGGGCCACTCCAAG AGAAAG ATC 3180 
TCCTGGGCTCAGGCAGGGGGACCAAACTGGGCAGGCTCCTGGGCAGGCTGTGAGCTGCCC 324 0 
CAGGGTAGTGGTCCAAGGCCGGCTCTGACCCGTGCTCTGCTGCCTCCAGCGGGAACCGGG 3300 
CAGACACTGCTGCTGCAAGCCCTGGTGTATGACGGCATAAAGAGCAACGGGAGAAAGAAG 3360 
CCGTCCCCAGCCTGCAGGAATCAGGTGGAAGCTGAGGTCATTGTCCACTCCGACTTCGGT 3420 
GCATC CAAAGG ATGTC CTG ACCTC C ACCTCCAAG ACCTGGAGC C AG AGG AACC ACTG ACT 34 80 
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GCAGAGACTCTGCCTTCCACGTCTGGAGCTGTGGATCTGTCTCAAGGAGCAGACTGGCTG 3540 
GGCAGGGAGCTGGGAGGGTGC C AAC CAACAACCAGTGGGC CAGAGAGGCTCACCTGCTTG 3600 
CCAGAAGCAGCCAGTGCCTCCTGCTCCTGCTCAGACCTCCAGCCCAGCACTGCTATAGAG 3660 
GAGGCCCCTGGGAAAAGCTGCCAGCCCAAAGCCCTGTGTCCTCTAACAGTCAGCCCAAGC 372 0 
CTTCCCAGGGCCCCTGTCTCCTCTGCTCAGGTCCCCTGAGCAGAAGGCAGATATGGCTCA 3780 
GGAACATGCCATGCATGGCTACACATGTGTGTACTAGAGATATCCATAAGTCCTTGGAGC 3840 
CTCTTAGGGTCCTTTGGCTGGGGTTGGGGAGAACTTTACTCTCCCTCATATTCTGCATCA 3 900 
CATACAGGAGGGACTTGAGACACAGCTCTGTGTAATGGACACGTGTGAAGTCGTGTGTGT 3960 
GTGTGTGTGTGTGTGTGCTGGTTGAGCTAGGAAACCTCTCCCTATGTAGCACTCACTGTG 4020 
GCCTAGTTGACCCTCCGTGGCAGGATGGTGTAACAGTGATCAGTGCCAGCTCTTTGAGCT 4080 
TTTAGCCTTGTCACCTAGCCTTTTATTACACTCTGAGAGTGTCTCCAGTGCTGTGTCTAC 4140 
AAAGACAGCGCCCAGCCCTCTTCTGTCAGCTGTGCTGAGCAGAGTGCCAGTCAACTCCAC 4200 
GGGCCTATGACACCGCAGCCTACCACAGCATGGCTGTCATCCCCCTGGCCTCCTAAGGTC 4260 
CAGATGTCTGGGTGAACCCAGCTCAGCTCCCCTCTCCTTTGAGCATCTCTGTACCTAATT 4320 
TTGTAATCTGGGAAGTGCCTGGTTTGGGAAATCTTCTTTCGCACCCTGTCCCTCTCTGCC 4380 
CCTTCCTTCATTTGTTCTGGTGATCTGTCTCATGTCATCTTGCTCGATTATCCTGGGGCC 4440 
CTTCTCTTTCCCATGATGC CCCTGATTTC CTCACTGCTGTTTTCATTTCTGT CTGCCATG 4500 
CTTGTCTTTATGTCGTGTGTTTCTCGTCCCTGAGTTCAACCTATGCACCCTTTCCTAACA 4560 
ACATGACTACCTCATGTCTGCTTCAGACCATAGTGTGACCCCTGGGTCCCCACAGCTCCC 4620 
CTGCCAACCGCCTTCCTGGGCAGATGAGCCCACTCCAAGTAGATCTGGAAAAGACCCTTG 468 0 
TGGCTTGTCTGGCTGCCCTCCCCTTGGTGTTGAGATGAGAAGGTTTTCTATGGAAGAGAT 4740 
GAGTCCAGGCTGCACAGGGGAACCCCCAAGAAGGGGTAGGGAGTGAAACCAAGAGGCTGA 4800 
AAAAAAATGGCTGCCACCCATCTGCACAGAGAGATGGGTGTGTGCTTTTGACGTGCAGTC 4860 
CTGGCTGAAACTGAAGGGGTGAGGAGAGGGGAGCTACTGGGGCTGCCATGGCTCAGTTCC 492 0 
CTGACCCTGGAGCCCTGAACCTGGCTTCAGAGTAGCAAAGAGTTTCCTCCAAGATGCTGT 4980 
AAGGGAAGTCTTTGCATAGGAAAAGGGCGGCTGGCTCATTTTATTTTATCTTTCTTTACA 504 0 
CTGAATCCCAAAATCATCTTACCACAAAGGGCCAAGCCTGACTGGTATTTCCTGAGTCAC 5100 
AAGAGCC ATGC CATCTCT CTGGTTTCTCACC TCAGTCATGTCC CAGAATTGTCAGATCC A 5160 
GTGGCATCTGTGCTCTTGCTGCACATCTTTCTATTTCAACTGGCTGGCACATCAAGTGTT 5220 
AACTCTGGCTTCTGGGCCAAGTTAGAAATAACC AGTCTATTTTCCCTTTATTTTATTTTA 5280 
TTTTATTTTATTTTATGTCTTTCAGTGGAGTTGTAGCTTCTGAAAGCGTCTGTGTTTATT 5340 
AGCCTTGTGTGTCACTCATGTTTGACCCCACCCACATTTCCTTCTCCTCCCCTCTTCAGC 5400 
CAGCCTATGATAACACTAAAGATTATTAATGCTGGCTTCGTATCTCATTAAAGACAGGAT 5460 
TGTCACTTGT^ACTACTTCTATAGCATTCAAAGTGGCCACGGCCAACACCACCGTATGTTT 552 0 
CTTCATTGCTCTGAAGGTCAAGAGCCTCATTTTGTTTTCCTGGTTAGATTCTTTTCCTCC 5580 
TTGCCTTGAATGAAATAACCGTTTTAACAGTAGGCTCTTAGCATCACACCACATAGTCAT 564 0 
TCCTCATGTTCTTGTTTAACAAGCACTTGAGGTTCTGGGTTTAAATTAAATAGCTGCAAA 57 00 
TGAGACAATTTATAACCCATTAGGCTGGGTGGAAAATTGTTCTC AAAAGCAAATAAGTAA 5760 
TAAATCTGGTATCTGCCTATAACTCACAGTTGATAAGAAAGTAGCCAGAACTCACTAGCA 5820 
TTATATATGATTGGGGTTCTGAGTAACTGGGGAGTGTTAGCTTTGTGACTTTGTAGCACC 5880 
AGGTCTTATTAGGAAAGTCTGTTGGCCTTTTACAGGGCATTAGTCCCTTTGTCGTTTGCC 5940 
ATGGATGCCTTAAGTTCTTTGGAGTCTCATTTAAGAATTCCTTTTCTCGAAGCATGACAA 6000 
GTGTATCGCAATACTTACATGCTCACTCGTTTACCTGGCTTAGTTTGTGCTGGGTTATTT 6060 
AATTGCACTTTCCAGCATCATGCTTCCTCCTTACAAATATGATATTCTTTATTGTTACAC 612 0 
TAAGGTGTTGATCATGTATCTGTCCCTGTAAAGAATTAATAAACTATTTTCCAGAC 6176 
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MARftDTGRGLLVLTFCLLSARGELPLPQETTVKLSCDEGPLQVILG PEQAVVLDCTLGATAAGPPTRVTWSKDGDTVLEH 
ENLHLLPNGSL WLSSPLEQEDSDDEEALRIWKVTEGSYSCLAHSPLGW ASQVAWKLATLEDFSLHPESOIVE ENGTAR 
FECHTKGLPAPIITWEKDQVTVPE EPRLITLPKWLLOILDVQDSDAGSYRCVATNSARO RFSOFASI TV A I fatbc 

QPWIVAAPENTTW SGQNWHECVASADPTPFVSWVRQDGKPISTDVIVLGRTNLLIASAQPRHSGVYVCRANKPLTR D 
FATAAAELRVLAAPAISQAPEALSRT RASTARFVCRASGEPRPALHWLHDGIPLRPWGRVKVQGGGGSLVITQIGLQDAG 
YYQCVAENSAGTACAAAPLAWVREGLP SAPTRVTATPLSSSSVLVAWERPELHSEQIIGFSLHYQKARGVDWVEYQFAV 
NNDTTELQVRD LEPNTDYEFYVVAYSOLGAS RTSSPALVHTLDDVP SAAPQLTLSSPNPSDIRVAWLPLPSSLSNGOVLK 
YKIEYGLGKED QVFSTEVPGNETQLTLNSLQPNKVYRVRISAGTGAGYGV PSQWHQHRTPGVHNQSHVP FAPAELKVRAK 
HESLWSWQPPPHPTQISGYKL YWGEVGTEEEADGDRPPGGRGDOAWDVGPVRLKKKVKQYELTQLVPGRPYEVKLVAFN 
KHEDGYAAVWKGKTEKAPTPDLPIQRGPP LPPAHVHAESNSSTSIWLRWKKPDFTTVKIVNYTVRFGPWGLRNASLVTYY 
TSSGEDILIGGLKPFTKYEFA VQSHGVDHDG PFGSWERSTLPDRP STPPSDLRLSPLTPSTVRLHWCPPTEPNGEIVEY 
LILYSNNHTQP EHQWTLLTTEGNrF5AEVHGLESDTRYFFKHGARTEVGPG PFSRL0DVITLOFTFSn<;inVH «BCTER 

MHM^M^RQSSHREALPGLSSSGTPGNPALYTRARLGPPSVPAAHELESLVHPRPQDWSPPPSOVEOKAE 
VHSLMGGSVSDCRGHSKRKISWAQAGGPNWAGSWAGCELPQGSGPRPALTRALLPPAGTGQTLLLQALVYDGIKSNGRKK 
PSPACRNQVEAEVIVHSOFGASKGCPDLHLQDLEPEEPLTAETLPSTSGAVDLSQGADWLGRELGGCQPTTSGPERLTCL 
PEAASASCSCSOLQPSTAIEEAPGKSCQPKALCPLTVSPSLPRAPVSSAQVP 



FIGURE 2B 



10 20 30 40 50 60 

1 I I 1 I I 

1 AGGCTGGTGGCGCGCGGGCGCGTGTCCCCTGTGGTGCAGGGTGGCCACACTGGCGGGGCG 

6 1 CCCCCGCGTGGGCCGCTAGCCCAAGATGGCGATGGAGGGGCGGGCGAGCTGGCCGCGGCC 

121 CCGGCCCCCGCGCCGGCCCCCGCTCGGCCCCGGCCCCGGAGGCCCGCGCCCCGCCCGCGG 

181 CGCCGCGCCTCCCGGAGCCACTGACGCCCGGCGCGCCCTCCCCCGGCGGCGGCCCAGGCG- 

MetAlaArgAlaAspThrGlyArgG 
241 CCCGGACGCGGCGGCAGCGGCCCGAGCCCGGCCC TATGGCGCGGGCGGACACGGGCPOrn 

splice 
site 

| intron 1 >> 
lylieuLeuValLeuThrPheCysIieuLeuSerAlaArg j 
301 GGCTCCTGG TGCTGACCTTCTGCCTGCTGTCCGCGCGCG GT^GGGCrrr^TnnPrniPn 

361 GTCGCGAGTGGGCGTCCCCGGCGCCCGCGATGCTTGCGCGCCGGGGGCTGTGGGGACTTG 
421 CCCCCAGGGGGTGTGTGTCCTTGCTGTGCACAGCCTGGCACCGTGCGTGTCCCCCTGCGC 
481 GTGGCCCTTGTGCATGTGAG 
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